Prosodic Word Grouping in Mandarin TTS System

نویسندگان

  • Qing Guo
  • Endong Xun
  • Nobuyuki Katae
چکیده

This paper reports the methodology and results of prosodic word grouping for a Mandarin TTS system developed by the Fujitsu Laboratories. In view of any inner prosodic word break will make speech unintelligible or unnatural, a new prosodic word grouping framework is proposed. The word segmentation result can be regarded as an initial prosodic word sequence with grids inserted into each word boundary. The target of prosodic word grouping is to remove those grids that are assessed needlessly in the prosodic word level. Several prosodic word grouping methods can work together because each makes its own decision. As one of three methods used in our TTS system, a binary prosodic tree method is also proposed in this paper. Lastly, experiment results and discussion are presented.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An NN-based Approach to Prosodic for Synthesizing English Words Em

In this paper, a neural network-based approach to generating proper prosodic information for spelling/reading English words embedded in background Chinese texts is discussed. It expands an existing RNN-based prosodic information generator for Mandarin TTS to an RNN-MLP scheme for Mandarin-English mixed-lingual TTS. It first treats each English word as a Chinese word and uses the RNN, trained fo...

متن کامل

Prosodic grouping and relative clause disambiguation in Mandarin

The study discusses the role of prosodic grouping in the Mandarin Relative Clause attachment disambiguation. The grouping effect is explored under the Implicit Prosody Hypothesis (IPH) from four aspects of sentence processing experiments: default production, contrast production, online processing, as well as offline processing. It is found that (1) the length of RC greatly impacts ambiguity res...

متن کامل

A probabilistic approach to prosodic word prediction for Mandarin Chinese TTS

Prosodic word is a basic rhythmic unit of Mandarin Chinese Speech. It is one of the most important factors determining the naturalness of the generated speech by a TTS system. This paper investigates the problem of predicting Chinese prosodic words from word sequence. First, we examine the patterns of Chinese prosodic words and investigate the key features for prediction. Then a baseline model ...

متن کامل

High-Quality Prosody Generation in Mandarin Text-to-Speech System

A text-to-speech (TTS) synthesizer is a computer-based system that can automatically read text aloud. Fujitsu is developing a Mandarin TTS system using state-of-the-art technologies. The prosodic structure of synthesized text provides important information for making synthetic speech produced by a TTS system more natural and understandable. This paper describes a global probability estimation m...

متن کامل

طراحی و ارزیابی یک مدل بازسازی گفتار به روش هم‌گذاری واحدهای حساس به بافت نوایی

This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006